Integrated Pathway Clusters with Coherent Biological Themes for Target Prioritisation

نویسندگان

  • Yi-An Chen
  • Lokesh P. Tripathi
  • Benoit H. Dessailly
  • Johan Nyström-Persson
  • Shandar Ahmad
  • Kenji Mizuguchi
چکیده

Prioritising candidate genes for further experimental characterisation is an essential, yet challenging task in biomedical research. One way of achieving this goal is to identify specific biological themes that are enriched within the gene set of interest to obtain insights into the biological phenomena under study. Biological pathway data have been particularly useful in identifying functional associations of genes and/or gene sets. However, biological pathway information as compiled in varied repositories often differs in scope and content, preventing a more effective and comprehensive characterisation of gene sets. Here we describe a new approach to constructing biologically coherent gene sets from pathway data in major public repositories and employing them for functional analysis of large gene sets. We first revealed significant overlaps in gene content between different pathways and then defined a clustering method based on the shared gene content and the similarity of gene overlap patterns. We established the biological relevance of the constructed pathway clusters using independent quantitative measures and we finally demonstrated the effectiveness of the constructed pathway clusters in comparative functional enrichment analysis of gene sets associated with diverse human diseases gathered from the literature. The pathway clusters and gene mappings have been integrated into the TargetMine data warehouse and are likely to provide a concise, manageable and biologically relevant means of functional analysis of gene sets and to facilitate candidate gene prioritisation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TargetMine, an Integrated Data Warehouse for Candidate Gene Prioritisation and Target Discovery

Prioritising candidate genes for further experimental characterisation is a non-trivial challenge in drug discovery and biomedical research in general. An integrated approach that combines results from multiple data types is best suited for optimal target selection. We developed TargetMine, a data warehouse for efficient target prioritisation. TargetMine utilises the InterMine framework, with n...

متن کامل

Prioritising integrated care initiatives on a national level. Experiences from Austria

INTRODUCTION AND BACKGROUND Based on a policy initiative and the foundation of the Competence Centre for Integrated Care by the Austrian Social Security Institutions in 2006, the aim of the project was to identify and prioritise potential diseases and target groups for which integrated care models should be developed and implemented within the Austrian health system. The project was conducted a...

متن کامل

Functional Categories Associated with Clusters of Genes That Are Co-Expressed across the NCI-60 Cancer Cell Lines

BACKGROUND The NCI-60 is a panel of 60 diverse human cancer cell lines used by the U.S. National Cancer Institute to screen compounds for anticancer activity. In the current study, gene expression levels from five platforms were integrated to yield a single composite transcriptome profile. The comprehensive and reliable nature of that dataset allows us to study gene co-expression across cancer ...

متن کامل

Long Waiting Times for Elective Hospital Care – Breaking the Vicious Circle by Abandoning Prioritisation

Background Policies assigning low-priority patients treatment delays for care, in order to make room for patients of higher priority arriving later, are common in secondary healthcare services today. Alternatively, each new patient could be granted the first available appointment. We aimed to investigate whether prioritisation can be part of the reason why waiting times for care are often...

متن کامل

Prediction of human protein-protein interaction by a mixed Bayesian model and its application to exploring underlying cancer-related pathway crosstalk.

Protein-protein interaction (PPI) prediction method has provided an opportunity for elucidating potential biological processes and disease mechanisms. We integrated eight features involving proteomic, genomic, phenotype and functional annotation datasets by a mixed model consisting of full connected Bayesian (FCB) model and naive Bayesian model to predict human PPIs, resulting in 40 447 PPIs wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2014